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Bats are connected with the increasing numbers of emerging and re-emerging viruses that may break the species barrier and 
spread into the human population. Coronaviruses are one of the most common viruses discovered in bats, which were considered 
as the natural source of recent human-susceptible coronaviruses, i.e. SARS-COV and MERS-CoV. Our previous study reported 
the discovery of a bat-derived putative cross-family recombinant coronavirus with a reovirus gene plO, named as Ro-BatCoV 
GCCDC1. In this report, through a two-year follow-up of a special bat population in one specific cave of south China, we 
illustrate that Ro-BatCoV GCCDC1 persistently circulates among bats. Notably, through the longitudinal observation, we 
identified the dynamic evolution of Ro-BatCoV GCCDC1 in bats represented by continuously recombination events. Our study 
provides the first glimpse of the virus evolution in one longitudinally observed bat population cohort and underlines the 
surveillance and pre-warning of potential interspecies transmittable viruses in bats. 
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INTRODUCTION 

Bats (Order Chiroptera) are thepg only group of mammals 
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capable of flight with the ability to circumnavigate at night, i. 
e., nocturnal animals (Calisher et ah, 2006). Bats have been 
the focus of particular interest not just because of their 
ecological interest, but also because of their medical im¬ 
portance in public health, as they play significant role in 
evolving and transmission of zoonotic viral agents that 
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stance a substantial danger to human and animal well-beings. 
Bats possess special peculiarities and features which make 
them host to different viruses, such features including 
structural diversity and complex lifestyle, diverse mode of 
feeding ranging from insectivores (feed on insects), frugi- 
vore (fruit eater), nectarivore (sucking nectar), palynivore 
(eating pollen), piscivore (eating fish) while others feed on 
arthropods, vertebrates or practice haematophagy (feed on 
blood). Bats are social animals that live in a densely popu¬ 
lated roost, demographic and spatial structure of population 
with long life, which may significantly contribute to main¬ 
taining viruses as well as the high possibility of intra- and 
inter-species transmission of viral infections (Calisher et al., 
2006). In addition, excreta during migration may be an im¬ 
portant factor in the spread of viruses (Lopez-Roig et al., 
2014). 

Bats are connected with the increasing numbers of emer¬ 
ging and re-emerging viruses including Hendra virus (HeV), 
Nipah virus (NiV) and Ebola virus (EBOV), and also cor- 
onaviruses, etc. (Li et al., 2016). The studies of bats as re¬ 
servoirs for a number of mammalian coronaviruses came to 
the spotlight since the discovery of severe acute respiratory 
syndrome (SARS)-like coronaviruses in 2005 isolated from 
south China (Lau et al., 2005; Li et al., 2005), and subse¬ 
quently, a SARS-CoV receptor ACE2-using coronavirus was 
isolated from bat (Ge et al., 2013). Sister to SARS-CoV was 
the middle east respiratory syndrome-associated coronavirus 
(MERS-CoV) emerged in 2012, the close-related viruses of 
which was also found in bats (Lelli et al., 2013; Zumla et al., 
2016). Actually, a number of bat coronaviruses have been 
documented in recent years (Hu et al., 2017), and the MERS- 
CoV phylogenetically closely related bat coronaviruses 
(BatCoVs) HKU4 can also recognize human CD26 for cell 
entry, indicating origin of human-susceptible coronaviruses 
from bats (Wang et al., 2014). 

Coronavirus encrypts a non-segmented, largest positive 
sense single-stranded RNA genome known ranging from 27 
to 32 kb in length (Lai, 1996; Sokolova et al., 1996; Du et al., 
2016). In the subfamily of Coronaviridae, Alphacoronavirus 
and Betacoronavirus infect and cause diseases in mammals, 
while Gammacoronavirus and Deltacoroanvirus are mostly 
confined to avian infections and diseases (Jackwood et al., 
2012; Woo et al., 2012). Report has shown that coronaviruses 
possess low proofreading capability that can also exhibit 
comparatively high mutation and recombinant rates, which 
make them one of the most diverse, genetically distinct, and 
recently emerging groups of viruses (Minskaia et al., 2006). 
The continuous mutation and recombination of the cor¬ 
onaviruses probably bridge the species barrier and enable 
them to spread into the human population (Baric et al., 1995). 

Recently, we discovered a novel Rousettus bat coronavirus 
GCCDC1 (Ro-BatCoV GCCDC1) with full genome se¬ 
quenced, which exhibits a putative heterologous re¬ 


combination event between a bat coronavirus and a bat 
orthoreovirus (Huang et al., 2016). We found that the plO 
gene of bat orthoreovirus in Ro-BatCoV GCCDC1 genome 
has no homologous gene from any parental strain of cor¬ 
onaviruses. However, there is dearth of information whether 
this virus with the recombinant gene can persist and evolve 
in their hosts. 

Herein we carried out a longitudinal survey on one po¬ 
pulation of Roussetus leschenaulti bats in Yunnan Province, 
Mainland China, between 2014 and 2015. The persistence of 
the Ro-BatCoV GCCDC 1 in the bat population and its po¬ 
tential evolution was investigated. Our study provides the 
first glimpse of the virus evolution in one prospectively and 
longitudinally observed bat population. 

RESULTS 

The persistent prevalence of the Ro-BatCoV GCCDC 1 
in the bats 

A total of 568 rectal swab samples from R. leschenaulti bats 
were collected between May 2014 (/7=118), October 2014 
(/?=270) and October 2015 (/?=180) in one cave located in 
Xishuangbanna, Yunnan Province, China (Table 1). The anal 
swab samples were screened individually for the presence of 
coronavirus using RT-PCR detection method. Coronavirus 
positive ratios were 39.8% (47/118), 38.8% (70/270) and 
35.6% (64/180) in bats sampled in May 2014, October 2014 
and October 2015, respectively (Table 1). These results 
showed a persistent prevalence of Ro-BatCoV GCCDC 1 in 
this bat population. 

The evolution of RdRp from Ro-Bat GCCDC 1 

Based on the identified PCR CoV-positive bat samples, the 
PCR amplicons with the length of 228 bp of Ro-BatCoV 
GCCDC 1 RdRp were sequenced, which showed the con¬ 
servation of this newly identified bat CoV. We analyzed the 
RdRp sequences which had the corresponding plO sequences 
available, thus we have 97 pairs of RdRp and plO from 97 bat 
specimens. The phylogenetic analysis of the RdRp sequences 
revealed two groups of Ro-BatCoV GCCDC 1 designated 
Group I (RdRp-I) and Group II (RdRp-II) (Figure 1A). The 
result from our study revealed that all the Ro-BatCoV 
GCCDC 1 RdRp from the first round sampling in May 2014 
belonged to the RdRp-I, while subsequent sampling in Oc¬ 
tober 2014 and 2015, respectively, showed the involvement 
of viruses featured with RdRp-II. 

The evolution of plO gene in Ro-BatCoV GCCDC 1 

Based on the result of PCR amplification using specific 
primers, the nucleic acid sequences of plO genes (420 bp) of 
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Table 1 Summary of sampling detail and coronavirus infections in bats a) 


Sampling time 

Bats with swab 
samples collected 

Bats with CoV 
positive (%) 

May 2014 

118 

47 (39.8%) 

October 2014 

270 

70 (38.8%) 

October 2015 

180 

64 (35.6%) 


a) *, All the positive samples were defined based on the PCR amplicons 
with the length of 228 bp of Ro-BatCoV GCCDC1 RdRp as described in 
Table 2. 97 RdRp sequences out of the 181 samples which had the corre¬ 
sponding plO sequences were utilized for the GCCDC evolution. 


A 


# May.2014 
Oct.2014 

• Oct.2015 



I 


IT 



Figure 1 Phylogenetic analyses of Ro-BatCoV GCCDC 1 for three 
sampling periods. Evolutionary trees of RdRp (A) and PlO (B) constructed 
for the samples collected from May, 2014 to Oct., 2015 has indicated by 
different colors as labeled. The maximum-likelihood trees were constructed 
with 1,000 replication available in MGEA6. 


Ro-BatCoV GCCDC 1 were subjected to phylogenetic ana¬ 
lysis, which displayed high level of sequence similarity to 
Ro-BatCoV GCCDC 1 plO genome segments (Huang et al., 
2016). In pairwise comparison of nucleotide sequence, the 
nucleotide sequence identity was 99% compared to Ro- 
BatCoV GCCDC 1. However, based on the phylogenetic 
analysis of all the available Ro-BatCoV GCCDC 1 in public 
domain, the plO genes were divided into two groups: Group 
(A) and Group (B) (Figure IB). All the three times of sam¬ 
pling Ro-BatCoV GCCDC 1 covered both Group A and 
Group B, which indicated the persistence of the groups of 
plO in Ro-BatCoV GCCDC 1. 

When compared to plO genes from traditional orthor- 
eoviruses (Figure 2), the plO from the typical six Ro-BatCoV 
GCCDC 1: GCCDC1/2-53 and GCCDC1/2-110 sampled in 
October 2014, and GCCDC1/3-19 and GCCDC1/3-31 in 
October 2015 and our previously identified Ro-BatCoV 
GCCDC 1 strains 346-KU762337 and Ro-BatCoV GCCDC1 
strains 356-KU762338, are clustered into one independent 


branch indicating the uniqueness of the bat coronavirus-de- 
rived plO gene. However, we can still observe that plO genes 
in 2-53 and 3-31 belong to Group A and plO of other four 
samples are in Group B. 

The potential dynamic recombination of Ro-BatCoV 
GCCDC1 

To further investigate the dynamic evolution of Ro-BatCoV 
GCCDC 1 in the longitudinally collected bat samples, we 
further analyzed the potential recombination of the viruses 
by constructing a model that pairs of RdRp and plO genes, 
each pair of which is sequenced from single Ro-BatCoV 
GCCDC 1 sample. Five different nonsynonymous mutations 
were found in all available RdRp protein segments (Figure 
3 A). However, only two mutations were featured by the two 
groups of RdRp (T-648 and 1-666 for RdRp-I and A-648 and 
M-666 for RdRp-II). Comparison of the translated amino 
acid sequences of plO gene of both Group A and B of Ro- 
BatCoV GCCDC 1 identified four nonsynonymous muta¬ 
tions at position 7, 14, 19 and 34, respectively (Figure 3B). 
Group A pi0 was featured with the combination of M-7, T- 
14, A-19, and 1-34, while Group B possesses substitutions V- 
7, T-14, N-19 and S-34 in the four positions. When we paired 
the corresponding RdRp and plO genes groups with their 
featured substitutions, there were four different combina¬ 
tions of the putative assembled genomes in all the bat sam¬ 
ples tested (Figure 4A). Ro-BatCoV GCCDC 1 virus IA 
possesses Group I of RdRp gene and Group A of plO gene. 
Ro-BatCoV GCCDC 1 virus IB is featured with Group I of 
RdRp gene and Group B of pi 0 gene. Ro-BatCoV GCCDC 1 
virus IIA has Group II of RdRp gene and Group A of pi 0 
gene. Ro-BatCoV GCCDC 1 virus IIB characterizes by 
Group II of RdRp gene and Group B of pi 0 gene. During the 
first round of sampling in May 2014, only viruses IA and IB 
were discovered in the bat population (Figure 4B). However, 
five months later in October 2014 IIA and IIB recombinants 
can be found in the bat swabs. Further, the third round 
sampling in October 2015 showed that while the four com¬ 
binations were persistent in the bat population, the two 
viruses IIA and IIB became more dominant. 


DISCUSSION 

In our previous study, we demonstrated that a novel cor¬ 
onavirus Ro-BatCoV GCCDC 1 encoded a putatively re¬ 
combinant nonstructural protein plO derived from reovirus, 
which induces the formation of syncytia in infected cells 
(Huang et al., 2016). Sequel to this, we now report that Ro- 
BatCoV GCCDC 1 is continuously circulating in bats from 
Yunnan Province in China. We observed high prevalence of 
the Ro-BatCoV GCCDC 1 on a consistent trend for a period 
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Figure 2 Phylogenetic relationships of Ro-BatCoV GCCDC1 compared to other viruses. Phylogenetic trees of the plO sequence of Ro-BatCoV GCCDC1 
together with the plO genes from other reoviruses. The phylograms were constructed by using MEGA6 software package to determine the maximum- 
likelihood tree for the sequence. 
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Figure 3 Multiple sequence alignment of Ro-BatCoV GCCDC1 genes based on samples from three sampling periods. Sequence alignment of deduced 
amino acid of RdRp (A) and plO proteins (B). Consensus and sequence logo for conserved amino acid residues are shown on top of the alignment. The 
residues with color indicate the position of mutation. 


of 2 years that is relatively high in proportion to the popu¬ 
lations under study. The cross-family recombinant plO gene 
stably exists in the novel virus. Notably, we observed the 
dynamic evolution of this virus in the special bat population. 
Our study provides beneficial recommendation for the sur¬ 
veillance and pre-warning of potential interspecies trans- 
mittable viruses in bats. 


The recombination events in coronavirus involving inter¬ 
change of intra-family genetic materials has been reported 
(Baric et al., 1995). In our previous study, a traditional re¬ 
ovirus gene plO located at the upstream of N protein and 
downstream of NS7a of Ro-BatCoV GCCDC1 genome 
showing the ability of coronavirus to undergo a heterologous 
inter-family recombination (Huang et al., 2016). In our 
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Figure 4 Recombination analysis of Ro-BatCoV GCCDC1. A, The re¬ 
combination model for Ro-BatCoV GCCDC1 based on the different groups 
of RdRp (Group I and Group II) and plO proteins (Group A and Group B). 
The groups are based on the different combinations of polymorphic amino 
acid residues in Ro-BatCoV GCCDC1. B, The ratios of different re¬ 
combinants of Ro-BatCoV GCCDC1 revealed two new groups, IIA and IIB 


current study, we observed a persistent prevalence Ro-Bat- 
CoV GCCDC1 in the bat population, and we also found that 
the recombinant plO is a constitutive gene in this virus. We 
did not get all the plO sequences from the RdRp positive 
samples. This may be due to a lower amplification efficiency 
of the plO primers. Thus, the deep sequencing of the whole 
genomes and also the virus isolation from the original sam¬ 
ples are still needed to elucidate more features of Ro-BatCoV 
GCCDC 1. 

The emergence of recombinant plO gene in Ro-BatCoV 
GCCDC 1 could be due to unhindered access of reoviruses in 
bat, being a prehistoric mammal which might have favoured 
viral integration and persistence. plO persistence may have 
utilized intrinsic factors mimicking ability to incorporate its 
genome into coronavirus or of the host’s cells. plO has been 
reported to share similarities with viroporins, which has the 
ability to form hydrophilic channels that allow low mole¬ 
cular weight hydrophilic molecules to cross the membrane 
thereby interrupt membrane integrity, breakdown ionic gra¬ 
dients and eventual release of essential compounds from the 
cell (Carrasco et al., 1989; Carrasco et al., 1993). This could 
lead to series of events that favor viral replication, encourage 
viral transmission and persistence of Ro-BatCoV GCCDC 1 
in bat population. Interestingly, the phylogenetic analysis 
revealed that two distinct groups of plO with featured amino 
acids persistently exist in Ro-BatCov GCCDC 1. However, 
the comparison of the two plO in the pathogenesis of bat 
coromavirus need further investigations. 

Summarily, the new Ro-BatCoV GCCDC 1 virus is not 
only persistently prevalent in the bat population, but also 
exhibits continuous evolution. Probability also exists that 
Ro-BatCoV GCCDC 1 might be under diverse selective 


pressures. The evolving bat coronaviruses could have seized 
the advantage of conserved mammalian receptors for cellular 
replication and other biological pathways that go through 
radiation over a long time which could boost the capability 
for conveyance of bat-related viruses to other animals 
(Calisher et al., 2006). Our study benefits the understanding 
of dynamic virus evolution in special bat population and 
underlines the surveillance of potential interspecies viruses 
in bats. 


MATERIALS AND METHODS 

Sample collection, storage and transportation 

Anal swab samples were collected from Rousettus lesche- 
naulti ( R . leschenaulti) bats captured on three occasions, 
May 2014, October 2014 and October 2015, in one cave 
located in Xishuangbanna, Yunnan Province, China. Rectal 
swabs were collected from each bat and placed in viral 
transport medium (VTM) in screw cap tubes, kept on cold ice 
(at 4°C to 10°C) and transported to the laboratory at local 
CDC where they were frozen in liquid nitrogen. The VTM 
containing Earle’s Balanced Salt Solution, 5% bovine albu¬ 
min, 50,000 pg mL 1 vancomycin, 50,000 pg mL" 1 amika¬ 
cin, 10,000 units mL” 1 nystatin as described in our previous 
study (Huang et al., 2016). The samples were later trans¬ 
ported to the laboratory in dry ice and stored at -80°C until 
further analysis. 

RNA extraction 

VTM containing swab samples were manipulated by cen¬ 
trifugation at 8,000 r min -1 for 5 min at 4°C using a Thermal 
centrifuge. Total RNA was extracted from 140 pL of the 
supernatants using the RNeasy mini kit (Qiagen, Germany) 
according to the manufacturer’s protocol. The RNA was 
eluted in 60 pL AVE buffer and stored at -80°C for sub¬ 
sequent use. 

Viral detection by nested-PCR 

The extracted total RNA was transcribed to cDNA using 
reverse transcriptase enzyme (Thermo, USA) and then 
screened for the presence of coronavirus RNA using pan- 
coronavirus RT-PCR degenerate primers. The primers were 
designed from a highly conserved RdRp region of cor¬ 
onaviruses (Table 2). Theoretically, this pan-coronavirus RT- 
PCR primers should amplify a 299 bp fragment of the 
polymerase gene of all coronaviruses in the first round and 
228 bp in a semi-nested PCR for the second round. If the 
samples were RT-PCR positive to coronavirus screening, 
specific plO primers (Table 2) that covered the whole plO 
gene were used to screen for the presence of reovirus plO 
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Table 2 The primers for coronaviridae gene identification and amplification 


Primer name 

Target 

gene 

Sequence (5'—>3') 

G+C (%) 

T m (°C) 

Length of the amplification 
fragments 

RdRp-OY 

RdRp 

T GTTATT GG A ACC ACG A AGTT YTAY GGN GGNT G 

42.1 

51.3 

OF+OR=299 bp 

RdRp -IF 

RdRp 

GTTTT GTT CTTAT GGGTT GGG ATTAYCCN A ART GY G A 

42.1 

52.3 

IF+OR=228 bp 

RdRp -OR 

RdRp 

TAGTAGCATCTCCGCTGCTAGTNCCNCCNGGYTT 

52.6 

53.4 


CoV-plO-OY 

plO 

GAAAGGGAGACTAAGAAGA 

42.1 

50.8 

OF+OR=715 bp 

CoV-plO-OR 

plO 

ACAGGCTATAACTGTAAATG 

35.0 

49.2 

CoV-plO-lF 

plO 

GCAGATGAGAATGACAAACCCAA 

43.5 

56.0 

IF+IR=447 bp 

CoV-plO-lR 

plO 

GCAGCAAGCAAGCAGAGCAA 

55.0 

57.4 


gene in Ro-BatCoV GCCDC1. The amplified PCR products 
were purified following electrophoresis in 1% agarose gels 
that revealed the expected amplicon in agarose gels. 

Virus isolation 

RT-PCR positive samples for coronavirus were cultured in 
human epithelial colorectal adenocarcinoma cells (CaCo-2), 
human lung carcinoma cells (A549), human epithelial type 2 
HeLa derivative cells (HEp-2), baby hamster kidney (BHK- 
21) and rhesus monkey kidney cells (LLC-mk2). Cells were 
grown in Dulbecco’s modified Eagle’s minimum essential 
medium (DMEM) supplemented with 10% or 20% fetal 
bovine serum (FBS) and incubated at 37°C with 5% C0 2 . 
The cell lines were inoculated with the supernatant of PCR 
positive samples in 24-well plates, incubated for 2 h for virus 
adsorption, 300 pL fresh cell culture medium with 2% FBS 
was added and incubated at 37°C in 5% C0 2 incubator. The 
plates were observed daily for viral cytopathic effect (CPE). 

Molecular and sequencing analysis 

The used reference sequences were downloaded from Gen- 
Bank. Multiple sequence alignment of nucleic acid se¬ 
quences was performed using ClustalW and the Geneious 
program (http://www.geneious.com). For RdRp and plO 
phylogenetic analysis, maximum-likelihood trees were con¬ 
structed with MEGA 6 (http://www.megasoftware.net), and 
1,000 bootstrap replicates were run. 
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